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Abstract 

Background: Many organisms coordinate cell growth and division through size control mechanisms: cells must 
reach a critical size to trigger a cell cycle event. Bacterial division is often assumed to be controlled in this way, but 
experimental evidence to support this assumption is still lacking. Theoretical arguments show that size control is 
required to maintain size homeostasis in the case of exponential growth of individual cells. Nevertheless, if the growth 
law deviates slightly from exponential for very small cells, homeostasis can be maintained with a simple 'timer' 
triggering division. Therefore, deciding whether division control in bacteria relies on a 'timer' or 'sizer' mechanism 
requires quantitative comparisons between models and data. 

Results: The timer and sizer hypotheses find a natural expression in models based on partial differential equations. 
Here we test these models with recent data on single-cell growth of Escherichia coli. We demonstrate that a 
size-independent timer mechanism for division control, though theoretically possible, is quantitatively incompatible 
with the data and extremely sensitive to slight variations in the growth law. In contrast, a sizer model is robust and fits 
the data well. In addition, we tested the effect of variability in individual growth rates and noise in septum positioning 
and found that size control is robust to this phenotypic noise. 

Conclusions: Confrontations between cell cycle models and data usually suffer from a lack of high-quality data and 
suitable statistical estimation techniques. Here we overcome these limitations by using high precision measurements 
of tens of thousands of single bacterial cells combined with recent statistical inference methods to estimate the 
division rate within the models. We therefore provide the first precise quantitative assessment of different cell cycle 
models. 

Keywords: Cell cycle. Bacteria, Division, Size control. Structured population equations. Numerical simulations, 
Nonparametric estimation 



Background 

Coordination between cell growth and division is often 
carried out by size control' mechanisms, where the cell 
size has to reach a certain threshold to trigger some 
event of the cell cycle, such as DNA replication or cell 
division [1]. As an example, the fission yeast Schizosaccha- 
romyces pombe exhibits a size threshold at mitosis [2,3]. 
The budding yeast Saccharomyces cerevisiae also uses a 
size control mechanism that acts at the Gl-S transition 
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[4,5]. In contrast, in some cells such as those of early frog 
embryos, progression in the cell cycle is size independent 
and relies on a 'timer' mechanism [6]. 

Bacterial division is often assumed to be under size con- 
trol but conclusive experimental evidence is still lacking 
and the wealth of accumulated data presents a com- 
plex picture. In 1968, building on the seminal work of 
Schaechter et al. and Helmstetter and Cooper, Donachie 
suggested that initiation of DNA replication is triggered 
when the bacterium reaches a critical size [7-9]. This pro- 
vided the basis for a long-standing model of size control 
where cell size triggers replication initiation, which in turn 
determines the timing of division (see [10] and references 
therein). However, the coupling of replication initiation 
to cell mass has been repeatedly challenged [11-13]. In 
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particular, on the basis of recent single-cell analysis, the 
team headed by N Kleckner proposed that replication ini- 
tiation is more tightly linked to the time elapsed since 
birth than to cell mass [13,14]. In addition, the extent to 
which initiation timing affects division timing is unclear. 
In particular, variations in initiation timing are known to 
lead to compensatory changes in the duration of chro- 
mosome replication (see [15-17] and references therein). 
These studies argue against a size control model based 
on replication initiation. Another model postulates that 
size control acts directly on septum formation [18,19]. 
Nevertheless, the nature of the signals triggering the for- 
mation of the septal ring and its subsequent constriction 
are still unknown [17,20] and no molecular mechanism is 
known to sense cell size and transmit the information to 
the division machinery in bacteria. 

Besides the work of Donachie, the assumption of size 
control in bacteria originates from a theoretical argu- 
ment stating that such a control is necessary in expo- 
nentially growing cells to ensure cell size homeostasis, 
i.e. to maintain a constant size distribution through suc- 
cessive cycles. The growth of bacterial populations has 
long been mathematically described using partial dif- 
ferential equation (PDE) models. These models rely on 
hypotheses on division control: the division rate of a 
cell, i.e. the instantaneous probability of its dividing, can 
be assumed to depend either on cell age (i.e. the time 
elapsed since birth) or cell size. In the classical sizer' 
model, the division rate depends on size and not on age 
whereas in the 'timer' model it depends on age and not 
on size. Mathematical analysis of these models sheds light 
on the role of size control in cell size homeostasis. In 
particular, it has been suggested that for exponentially 
growing cells, a timer mechanism cannot ensure a sta- 
ble size distribution [21,22]. Nevertheless, this unrealistic 
behavior of the timer mechanism is based on a biolog- 
ically meaningless assumption, namely the exponential 
growth of cells of infinitely small or large size [23,24]. 
Cells of size zero or infinity do not exist and particularly 
small or large cells are likely to exhibit abnormal growth 
behavior. In conclusion, the mathematical arguments that 
were previously developed are insufficient to rule out 
a size-independent, timer model of bacterial division: 
quantitative comparisons between models and data are 
needed. 

In the present study, we test whether age (i.e. the time 
elapsed since birth) or size is a determinant of cell divi- 
sion in E. coli. To do so, we analyzed two datasets derived 
from two major single-cell experimental studies on E. coli 
growth, performed by Stewart et al. [25] and Wang et al 
[26]. Our analysis is based on division rate estimation 
by state-of-the-art nonparametric inference methods that 
we recently developed [27,28]. The two datasets corre- 
spond to different experimental setups and image analysis 



methods but lead to similar conclusions. We show that 
even though a model with a simple timer triggering divi- 
sion is sufficient to maintain cell size homeostasis, such 
a model is not compatible with the data. In addition, our 
analysis of the timer model shows that this model is very 
sensitive to hypotheses regarding the growth law of rare 
cells of very small or large size. This lack of robustness 
argues against a timer mechanism for division control in 
E. coli as well as in other exponentially growing organisms. 
In contrast, a model where cell size determines the prob- 
ability of division is in good agreement with experimental 
data. Unlike the timer model, this sizer model is robust to 
slight modifications of the growth law of individual cells. 
In addition, our analysis reveals that the sizer model is 
very robust to phenotypic variability in individual growth 
rates or noise in septum positioning. 

Results and discussion 

Description of the data 

Age and size distribution of the bacterial population 

The results reported in this study were obtained from 
the analysis of two different datasets, obtained through 
microscopic time-lapse imaging of single E, coli cells 
growing in a rich medium, by Stewart et al [25] and Wang 
et al [26] . Stewart et al followed single E, coli cells grow- 
ing into microcolonies on LB-agarose pads at 30°C. The 
length of each cell in the microcolony was measured every 
2 min. Wang et al grew cells in LB: Luria Bertani medium 
at 37°C in a microfluidic setup [26] and the length of the 
cells was measured every minute. Due to the microflu- 
idic device structure, at each division only one daughter 
cell could be followed (data 5/: sparse tree), in contrast 
to the experiment of Stewart et al. where all the indi- 
viduals of a genealogical tree were followed (data/^-: full 
tree). It is worth noting that the different structures of the 
data 75 and 5/ lead to different PDE models, and the sta- 
tistical analysis was adapted to each situation (see below 
and Additional file 1). From each dataset (// and Si) we 
extracted the results of three experiments (experiments 
/i,/2 and/s and 5i,52 and 53). Each experiment y^- corre- 
sponds to the growth of approximately six microcolonies 
of up to approximately 600 cells and each experiment 
Si to the growth of bacteria in 100 microchannels for 
approximately 40 generations. 

Given the accuracy of image analysis, we do not take 
into account variations of cell width within the popula- 
tion, which are negligible compared to cell-cycle-induced 
length variations. Thus, in the present study we do not 
distinguish between length, volume and mass and use 
the term cell size as a catch-all descriptor. Cell age and 
cell size distributions of a representative experiment from 
each dataset are shown in Figure 1. These distributions 
are estimated from the age and size measurements of 
every cell at every time step of a given experiment fi or 
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Si, by using a simple kernel density estimation method 
(kernel estimation is closely related to histogram con- 
struction but gives smooth estimates of distributions, 
as shown in Figure 1, for instance; for details see the 
Methods and Additional file 1). As expected for the dif- 
ferent data structures (full tree fi or sparse tree 5/) and 
different experimental conditions, the distributions for 
the two datasets are not identical. The age distribution 
is decreasing with a maximum for age zero and the 
size distribution is wide and positively skewed, in agree- 
ment with previous results using various bacterial models 
[29-31]. 

Testing the timer versus sizer models of division 
Age-structured (timer) and size-structured (sizer) models 

The timer and sizer hypotheses are easily expressed in 
mathematical terms: two different PDE models are com- 
monly used to describe bacterial growth, using a divi- 
sion rate (i.e. the instantaneous probability of division) 
depending either on cell age or cell size. In the age- 
structured model (Age Model) the division rate is a 
function only of the age a of the cell. The density n(t, a) 
of cells of age a at time t is given as a solution to the 
Mckendrick-Von Foerster equation (see [32] and refer- 
ences therein): 

—n(t, a) + —n(t, a) = -B^(a)n(t, a) (1) 
at da 

with the boundary condition 

poo 

n(tya = 0) = 2 / B2,(a)n(tya)da 
Jo 

In this model, a cell of age a at time t has the probability 
B2,{a)dt of dividing between time t and t + dt. 

In the size-structured model (Size model), the division 
rate is a function only of the size x of the cell. Assuming 
that the size of a single cell grows with a rate v{x)y the den- 



sity x) of cells of size x at time t is given as a solution 
to the size-structured cell division equation: [32] 

—n(t,x) + — {v{x)n{t,x)) = -Bs(x)n(t,x) 

dt dx (2) 

-\- 4Bs(2x)n(t, 2x) 

In the Size Model, a cell of size x at time t has the prob- 
ability ^s(«^)'^^ of dividing between time t and t + dt. This 
model is related to the so-called sloppy size control model 
[33] describing division in S. pombe. 

For simplicity, we focused here on a population evolv- 
ing along a full genealogical tree, accounting for fi data. 
For data 5/ observed along a single line of descendants, an 
appropriate modification is made to Equations (1) and (2) 
(see Additional file 1: Supplementary Text). 

Testing the Age Model (timer) and the Size Model (sizer) with 
experimental data 

In this study we tested the hypothesis of an age-dependent 
versus size-dependent division rate by comparing the abil- 
ity of the Age Model and Size Model to describe exper- 
imental data. The PDE given by Equations (1) and (2) 
can be embedded into a two-dimensional age-and-size- 
structured equation (Age & Size Model), describing the 
temporal evolution of the density n(t,a,x) of cells of age 
a and size x at time t, with a division rate 5a,s ^ priori 
depending on both age and size: 

/ a a \ a 

\ nit, a,x) -\ (vix) nit, a, x)) = 

\dt da) ^ ^ dx^ ^ (3) 

— B2,,s{a, x)n(t, a, x) 
with the boundary condition 

' ^a,s 2x) nit, a, 2x) da 

0 

In this augmented setting, the Age Model governed 
by the PDE (1) and the Size Model governed by (2) are 
restrictions to the hypotheses of an age-dependent or 
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size-dependent division rate, respectively (5a,s = or 

The density n{t,a,x) of cells having age a and size x 
at a large time t can be approximated as n(t,a,x) ^ 
e^^N{ayX)y where the coefficient A. > 0 is called the 
Malthus coefficient and N{ay x) is the stable age-size dis- 
tribution. This regime is rapidly reached and time can 
then be eliminated from Equations (1), (2) and (3), which 
are thus transformed into equations governing the stable 
distribution A/' Importantly, in the timer model (i.e. 

3 = B^), the existence of this stable distribution requires 
that growth is sub-exponential around zero and infinity 
[23,24]. 

We estimate the division rate of the Age Model using 
the age measurements of every cell at every time step. 
Likewise, we estimate the division rate of the Size 
Model using the size measurements of every cell at every 
time step. Our estimation procedure is based on math- 
ematical methods we recently developed. Importantly, 
our estimation procedure does not impose any particu- 
lar restrictions on the form of the division rate function 
B, so that any biologically realistic function can be esti- 
mated (see Additional file 1: Section 4 and Figure S6). In 
Additional file 1: Figures SI and S2, we show the size- 
dependent and age-dependent division rates Bs(x) and 
Bsi(a) estimated from the experimental data. Once the 
division rate has been estimated, the stable age and size 
distribution N(a, x) can be reconstructed through simu- 
lation of the Age & Size Model (using the experimentally 
measured growth rate; for details see the Methods). 

We measure the goodness-of-fit of a model (timer or 
sizer) by estimating the distance V between two distri- 



butions: the age-size distribution obtained through sim- 
ulations of the model with the estimated division rate 
(as explained above), and the experimental age-size dis- 
tribution. Therefore, a small distance V indicates a good 
fit of the model to the experimental data. To estimate 
this distance we use a classical metric, which measures 
the average of the squared difference between the two 
distributions. As an example, the distance between two 
bivariate Gaussian distributions with the same mean and 
a standard deviation difference of 10% is 17%, and a 25% 
difference in standard deviation leads to a 50% distance 
between the distributions. The experimental age-size dis- 
tribution is estimated from the age and size measurements 
of every cell at every time step of a given experiment// or 
Si, thanks to a simple kernel density estimation method. 

Analysis of single-cell growth 

As mentioned above, to avoid unrealistic asymptotic 
behavior of the Age Model and ensure the existence of 
a stable size distribution, assumptions have to be made 
on the growth of very small and large cells, which cannot 
be exactly exponential. To set realistic assumptions, we 
first studied the growth of individual cells. As expected, 
we found that during growth, a cell diameter is roughly 
constant (see inset in Figure 2A). Figure 2 A shows cell 
length as a function of time for a representative cell, sug- 
gesting that growth is exponential rather than linear, in 
agreement with previous studies [25,26,34-36]. To test this 
hypothesis further, we performed linear and exponential 
fits of cell length for each single cell. We then calculated 
in each case the coefficient of determination, which is 
classically used to measure how well a regression curve 




Age (min) Cell length (|jm) 

Figure 2 Analysis of single-cell growth. (A) Cell length vs cell age for a representative cell (black dots); exponential fit (red curve) and linear fit 
(black line). Inset: Cell width vs cell age for the same cell. (B) Increase in cell length during one time step (i.e. 1 min) as a function of cell length for fj 
data. During the lifetime of a cell, the cell length is measured at each time step and the increase in cell length between successive time steps is 
calculated. Black dots are the average length increase for every cell of a given experiment fi , as a function of cell length; error bars are the average 
+/ - 2 SEM (standard error of the mean). The red line is a linear fit for lengths between 2.5 pm and 4.5 pm. Inset: For each single cell of f], the 
evolution of cell length with age was fitted with a linear or an exponential function (as shown in panel A). We thus obtain a distribution of 
coefficients corresponding to the linear (green) and exponential (red) fits. 
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approximates the data (a perfect fit would give = 1 and 
lower values indicate a poorer fit). The inset of Figure 2B 
shows the distribution of the coefficient for all single 
cells for exponential (red) and linear (green) regressions, 
demonstrating that the exponential growth model fits the 
data very well and outperforms the linear growth model. 
We then investigated whether the growth of cells of par- 
ticularly small or large size is exponential. If growth is 
exponential, the increase in length between each mea- 
surement should be proportional to the length. Therefore, 
we averaged the length increase of cells of similar size 
and tested whether the proportionality was respected for 
all sizes. As shown in Figure 2B, growth is exponential 
around the mean cell size but the behavior of very small 
or large cells may deviate from exponential growth. We 
therefore determined two size thresholds Xmin and Xmax 
below and over which the growth law may not be expo- 
nential (e.g. for the experiment /i shown in Figure 2B, we 
defined 

2ind Xmax = 5.3 |im). 

The age-size joint distribution ofE. coli corresponds to a 
size-dependent division rate 

We used both the Age Model and Size Model to fit 
the experimental age-size distributions, following the 
approach described above. The growth law below Xmin and 
above Xmax is unknown. Therefore, to test the Age Model, 
growth was assumed to be exponential between Xmin and 
Xmax and we tested several growth functions v(x) for x < 
Xmin and X > Xmax> such as constant (i.e. linear growth) 
and polynomial functions. Figure 3 shows the best fit we 
could obtain. Comparing the experimental data/i shown 
in Figure 3A (Figure 3B for si data) with the reconstructed 
distribution shown in Figure 3C (Figure 3D for 5i data) 
we can see that the Age Model fails to reconstruct the 
experimental age-size distribution and produces a distri- 
bution with a different shape. In particular, its localization 
along the y-axis is very different. For instance, for/i data 
(panels A and C), the red area corresponding to the maxi- 
mum of the experimental distribution is around 2.4 on the 
j-axis whereas the maximum of the fitted distribution is 
around 3.9. The y-axis corresponds to cell size. The size 
distribution produced by the Age Model is thus very dif- 
ferent from the size distribution of the experimental data 
(experimental and fitted size distributions are shown in 
Additional file 1: Figure S9). 

As an additional analysis to strengthen our conclusion, 
we calculated the correlation between the age at divi- 
sion and the size at birth using the experimental data. 
If division is triggered by a timer mechanism, these two 
variables should not be correlated, whereas we found a sig- 
nificant correlation of —0.5 both for si SLndfi data {P < 
10"^^; see Additional file 1: Figure S7). 

We used various growth functions for X < Xmin 

and 

^ > ^max but a satisfying fit could not be obtained with 



the Age Model. In addition, we found that the results 
of the Age Model are very sensitive to the assumptions 
made for the growth law of rare cells of very small and 
large size (see Additional file 1: Figure S3). This ultra- 
sensitivity to hypotheses regarding rare cells makes the 
timer model unrealistic generally for any exponentially 
growing organisms. 

In contrast, the Size Model is in good agreement with 
the data (Figure 3: A compared to E and B compared to 
F) and allows a satisfactory reconstruction of the age-size 
structure of the population. The shape of the experimental 
and fitted distributions as well as their localization along 
the J- axis and ;\:-axis are similar (size distributions and age 
distributions, i.e. projections onto the j-axis and ;^-axis, 
are shown in Additional file 1: Figure S8). 

The quantitative measure of goodness-of-fit defined 
above is coherent with the curves' visual aspects: for the 
Size Model the distance V between the model and the data 
ranges from 17% to 20% for^- data (16% to 26% for 5/ data) 
whereas for the Age Model it ranges from 51% to 93% for 
fi data (45% to 125% for 5/). 

The experimental data has a limited precision. In par- 
ticular, the division time is difficult to determine pre- 
cisely by image analysis and the resolution is limited by 
the time step of image acquisition (for 5/ and fi data, 
the time step represents respectively 5% and 8% of the 
average division time). By performing stochastic simu- 
lations of the Size Model (detailed in Additional file 1: 
Section 6), we evaluated the effect of measurement noise 
on the goodness of fit of the Size Model. We found 
that noise of 10% in the determination of the division 
time leads to a distance V around 14%, which is of the 
order of the value obtained with our experimental data. 
We conclude that the Size Model fits the experimen- 
tal data well. Moreover, we found that in contrast to 
the Age Model, the Size Model is robust with respect 
to the mathematical assumptions for the growth law for 
small and large sizes: the distance V changes by less than 
5%. 

Size control is robust to phenotypic noise 

Noise in the biochemical processes underlying growth 
and division, such as that created by stochastic gene 
expression, may perturb the control of size and affect 
the distribution of cell size. We therefore investigated the 
robustness of size control to such phenotypic noise. The 
Size Model describes the growth of a population of cells 
with variable age and size at division. Nevertheless, it does 
not take into account potential variability in individual 
growth rate or the difference in size at birth between two 
sister cells, i.e. the variability in septum positioning. To 
do so, we derived two PDE models, which are revised 
Size Models with either growth rate or septum posi- 
tioning variability (see Additional file 1: Supplementary 
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Figure 3 Experimental and reconstructed age-size distributions for representative experiments from Stewart etal. [25] (fi) and Wang et 
al. [26] (si). (A,B) Experimental age-size distributions for representative experiments f] (A) and Si (B). Tine frequency of cells of age o and sizes in 
the population is represented by the color at the point of coordinate a on the x-axis and s on they-axis, according to the scale indicated to the right 
of the figure. (C,D) Reconstruction of the distributions using the Age Model (C: reconstruction of the data f] shown in panel A; D: reconstruction of 
the data si shown in panel B). These reconstructed distributions were obtained from simulations with the Age Model using a division rate estimated 
from the data (C: from f] , D: from si ). The growth functions used for the simulations are detailed in the Methods section. (E,F) Reconstruction of the 
distributions using the Size Model (E: reconstruction of the data f] shown in panel A; F: reconstruction of the data Si shown in panel B). These 
distributions were obtained from simulations with the Size Model using a division rate estimated from the data (E: from f] , F: from si ) with an 
exponential growth function (see Methods). 



Text) and ran these models with different levels of 
variability. 

Variability in individual growth rate has a negligible effect on 
the size distribution 

For each single cell, a growth rate can be defined as 
the rate of the exponential increase of cell length with 
time [25,26]. By doing so, we obtained the distribution of 
the growth rate for the bacterial population (Additional 
file 1: Figure S4A). In our dataset this distribution is sta- 
tistically compatible with a Gaussian distribution with 
a coefficient of variation of approximately 8% (standard 
deviation/mean = 0.08). 

We recently extended the Size Model to describe the 
growth of a population with single-cell growth rate vari- 
ability (the equation is given in Additional file 1: Section 5) 
[28]. We simulated this extended Size Model using the 



growth rate distribution oifi data. The resulting size dis- 
tribution is virtually identical to the one obtained without 
growth rate variability (Figure 4A, red and blue lines). 
Therefore, the naturally occurring variability in individual 
growth rate does not significantly perturb the size control. 
To investigate the effect of growth rate variability further, 
we simulated the model with various levels of noise, using 
truncated Gaussian growth rate distributions with coeffi- 
cients of variation from 5 to 60%. We found that to obtain 
a 10% change in size distribution, a 30% coefficient of vari- 
ation is necessary, which would represent an extremely 
high level of noise (Figure 4A, inset). 

Variability in septum positioning has a negligible effect on 
size distribution 

The cells divide into two daughter cells of almost identi- 
cal length. Nevertheless, a slight asymmetry can arise as 
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Size Size 
Figure 4 Influence of the variability in individual growth rate and septum positioning on the cell size distribution. (A) Size distributions 
simulated using the Size Model with the division rate estimated from f] data and an exponential growth iv(x) = vx). In blue: Simulations with the 
same growth rate v = 0.0274 for every cell. Red dashed line: Simulations with individual growth rates distributed according to the experimentally 
observed distribution. Green line: Growth rates normally distributed with coefficient of variation CV = 60%. Inset: Difference (i.e. normalized 
integrated squared error) between the size distribution simulated without variability and the distributions simulated with various levels of variability 
(normally distributed individual growth rates of GV between 1 0% and 60% ). (B) Simulated size distributions using the Size Model with the same 
division rate Bs as in A and a constant growth rate v = 0.0274. In blue: Simulations where division is perfectly symmetrical. Red dashed line: 
Simulations with a variable septum position distributed according to the experimentally observed distribution. Green line: Simulations with a 
normally distributed septum position with GV = 30%. Inset: Difference between the size distribution simulated without variability in the septum 
position and the distributions simulated with various levels of variability (normally distributed septum position of GV between 5% and 30%). GV, 
coefficient of variation. 



an effect of noise during septum positioning. We found 
a 4% variation in the position of the septum (Additional 
file 1: Figure S4B), which is in agreement with previous 
measurements [35,37-39]. To test the robustness of size 
control to noise in septum positioning, we extended the 
Size Model to allow for different sizes of the two sister 
cells at birth (the equation is given in Additional file 1: 
Section 5). We ran this model using the empirical vari- 
ability in septum positioning (shown in Additional file 1: 
Figure S4B) and compared the resulting size distribution 
to the one obtained by simulations without variability. As 
shown in Figure 4B (comparing the red and blue lines), the 
effect of natural noise in septum positioning is negligible. 
We also ran the model with higher levels of noise in sep- 
tum positioning and found that a three times higher (12%) 
coefficient of variation is necessary to obtain a 10% change 
in size distribution (Figure 4B inset, and Additional file 1: 
Figure S5). 

Conclusions 

In the present study, we present statistical evidence to sup- 
port the hypothesis that a size-dependent division rate 
can be used to reconstruct the experimental age-size dis- 
tribution of E. coli. In contrast, this distribution cannot 
be generated by a timer model where the division rate 
depends solely on age. Even though the timer model can 
maintain cell size homeostasis, it is quantitatively incom- 
patible with the observed size distribution. Our analysis of 
two different datasets shows the robustness of our conclu- 
sions to changes in experimental setup and image analysis 
methods. Our results therefore confirm the hypothesis of 
size control of division in E, coli. In addition, our anal- 
ysis of the timer model shows that it is very sensitive 



to mathematical assumptions for the growth law of very 
rare cells of abnormal size, suggesting that this model is 
unrealistic for any exponentially growing organisms. 

Noise in biochemical processes, in particular gene 
expression, can have a significant effect on the precision of 
biological circuits. In particular, it can generate a substan- 
tial variability in the cell cycle [5]. Therefore we investi- 
gated in bacteria the robustness of size control to noise in 
the single-cell growth rate and septum positioning, using 
appropriate extensions of the Size Model. We found that 
variability of the order of what we estimated from E, coli 
data does not significantly perturb the distribution of cell 
size. Therefore, in a natural population exhibiting pheno- 
typic noise, the control of cell size is robust to fluctuations 
in septum positioning and individual growth rates. From 
a modeling perspective, this demonstrates that the simple 
Size Model is appropriate for describing a natural bacterial 
population showing phenotypic diversity. 

Our approach is based on comparisons between PDF 
models and single-cell data for the cell cycle. Such com- 
parisons were attempted a few decades ago using data 
from yeasts (e.g. [21,33]). Nevertheless, these interesting 
studies were hampered by the scarcity and poor qual- 
ity of single-cell data as well as the lack of appropriate 
statistical procedures to estimate the division rate within 
the models. In contrast, we used high-precision measure- 
ments of tens of thousands cells in combination with 
modern statistical inference methods, which allowed us 
to assess quantitatively the adequacy of different models. 
We think this approach could prove successful in study- 
ing other aspects of the cell cycle, such as the coordination 
between replication and division or the molecular mecha- 
nisms underlying size control of division. Several different 
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mechanisms involved in division control in bacteria have 
already been unraveled, in particular MinCD inhibition 
and nucleoid occlusion [40-42]. We believe that a better 
understanding of the relative roles played by MinCD inhi- 
bition and nucleoid occlusion in division control can be 
gained by analyzing the age-size distributions of minCD 
and nucleoid occlusion mutants. We are therefore cur- 
rently performing time-lapse microscopy experiments to 
record the growth of such mutants. 

Methods 

Data analysis 

The data of Stewart et al contain the results of sev- 
eral experiments performed on different days, each of 
them recording the simultaneous growth of several micro- 
colonies of the MG1655 E, coli strain on LB-agar pads at 
30°C, with a generation time of approximately 26 min [25]. 
The first 150 min of growth were discarded to limit the 
effects of non-steady-state growth (cells undergo a slight 
plating stress when put on microscopy slides and it takes 
several generations to recover a stable growth rate). For 
the dataset obtained by Wang et al, the MG1655 E, coli 
strain was grown in LB at 37°C in a microfluidic device 
with a doubling time of approximately 20 min. To avoid 
any effect of replicative aging such as described in [26], we 
only kept the first 50 generations of growth. In addition 
the first ten generations were discarded to ensure steady- 
state growth. Both datasets were generated by analyzing 
fluorescent images (the bacteria express the Yellow Fluo- 
rescent Protein) using two different software systems. For 
Si data, cell segmentation was based on the localization of 
brightness minima along the channel direction (see [26]). 
In the same spirit, iov fi data, local minima of fluores- 
cence intensity were used to outline the cells, following 
by an erosion and dilation step to separate adjacent cells 
(see [25]). To measure its length, a cell was approximated 
by a rectangle with the same second moments of pixel 
intensity and location distribution (for curved cells the 
measurement was done manually). 

For both datasets we extracted data from three experi- 
ments done on different days. We did not pool the data 
together to avoid statistical biases arising from day-to-day 
differences in experimental conditions. Each analysis was 
performed in parallel on the data corresponding to each 
experiment. 

Numerical simulations and estimation procedures 

All the estimation procedures and simulations were 
performed using MATLAB. Experimental age-size 
distributions, such as those shown in Figure 3A,B, were 
estimated from the size and age measurements of every 
cell at every time step using the MATLAB kde2D function, 
which estimates the bivariate kernel density. This esti- 
mation was performed on a regular grid composed of 2^ 



equally spaced points on [O.A^ax] and 2^ equally spaced 
points on [0,X^^^], where Aj^ax is the maximal cell age 
in the data and X^ax the maximal cell size (for instance 
Ayyiax = 60 min and Xj^ax = 10 [im for the experiment /i, 
as shown in Figure 3A). To estimate the size-dependent 
division rate for each experiment, the distribution of 
size at division was first estimated for the cell size grid 
[OyXynax] using the ksdensity function. This estimated 
distribution was then used to estimate for the size grid 
using Equation (20) (for 5/ data) or (22) {for fi data) of 
Additional file 1. The age-size distributions corresponding 
to the Size Model (Figure 3E,F) were produced by running 
the Age & Size Model (Equation (3) in the main text) using 
the estimated division rate and an exponential growth 
function (v(x) = vx) with a rate v directly estimated from 
the data as the average of single-cell growth rates in the 
population (e.g. v = 0.0274 min~^ for the/i experiment 
and V = 0.0317 min~^ for 5i). For the Age & Size Model, 
we discretized the equation along the grid [O.Aynax] 
and [OyXmax]> using an upwind finite volume method 
described in detail in [43]. We used a time step: 



2^xmax(v(;c)) _|_ 2^ 

meeting the CFL: Courant-Friedrichs-Lewy stability 
criterion. We simulated n{t,a,x) iteratively until 
the age-size distribution reached stability {\{n{t + 
dtyUyX) — n{tyayX))\ < 10"^). To eliminate the Malthu- 
sian parameter, the solution n(t, a, oc) was renormalized at 
each time step (for details see [43]). 

The age-dependent division rate for each experi- 
ment was estimated for the cell age grid [O.Amax] using 
Equation (14) and (16) of Additional file 1. Using this 
estimated division rate, the age-size distributions corre- 
sponding to the Age Model (Figure 3C,D) were produced 
by running the Age & Size Model. As explained in the 
main text, we used various growth functions for small and 
large cells (i.e. for x < Xmin and x > Xmax'y between Xmin 
and Xmax growth is exponential with the same rate as for 
the Size Model). For instance for the fit of the experi- 
ment /i shown in Figure 3C, for x < 2.3 \im and x > 
5.3 (im, v(x) = m2ix(p(x), 0), with p(x) = —0.0033;^^+ 
0.036;^^ - 0.094;\: + 0.13. Likewise, for the fit of the exper- 
iment si shown in Figure 3D, for x < 3.5 [im and x > 
7.2 |im, v(x) = m2ix(p(x), 0), with p(x) = -0.0036;c^ + 
0.063:^:-^ — 0.33:\: + 0.67. For each dataset the polynomial 
p(x) was chosen as an interpolation of the function giv- 
ing the length increase as a function of length (shown in 
Figure 2B for/i data). 

Simulations of the extended size models with variabil- 
ity in growth rates or septum positioning (Equations (23) 
and (24) in Additional file 1) were performed as for the 
Age & Size Model, with an upwind finite volume scheme. 
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To simulate Equation (23), we used a grid composed of 2^ 
equally spaced points on [0, Xmax] and 100 equally spaced 
points on [OSvmim l.lv max]^ where Vmin and Vmax are the 
minimal and maximal individual growth rates in the data. 

Additional file 



Additional file 1 : Supplementary text and figures. 
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